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restricted. In the recent studies, the classical blind source separation 


Blind source separation techniques will not give the perfect solution for the cocktail parity problem. 


Cocktail party problem These methods struggle with convergence speed and accuracy issues as well. 
ICA ; In order to enhance the separation process and get over these issues, this 
Speech separation work adopted the glowworm swarm technique based on kurtosis as the 
Wiener filter objective function. The results show that the proposed technique produces 
good separation. The original and estimated signals are compared for 
similarity using the cross-correlation function. 
This is an open access article under the CC BY-SA license. 
OK 
Corresponding Author: 
Ahmed Kareem Abdullah 


Department of Electrical Engineering techniques, Al-Mussaib Technical College 
Al-Furat Al-Awsat Technical University 

Babil, Iraq 

Email: ahmedalbakri2012 @atu.edu.iq 


1. INTRODUCTION 

The human brain's acoustic system has the ability to pick and focus its attention on a single sound 
source in a noisy and loud setting, which is how the cocktail party problem (CPP) is solved [1]. According to 
prior study, because to the nature and settings of the CPP, it is difficult to come up with common solutions. 
The system cannot distinguish the audio signal or intended sources from interface noise and competing 
signals in an indoor noisy setting. The CCP occurs indoors, where the position and nature of signals are 
unknown, as well as the signal weights and ages in the mixing material [2]. Many scholars and academics 
have tried various ways to solve the CPP, with the Independent component analysis (ICA) technique being 
the most well-known. 

The blind source extraction (BSE) or separation (BSS) approach is used to separate the various 
sources. The term "blind" refers to the fact that initial signals or the environment system's mixing are 
unknown or poorly understood [3]. Blind source extraction has a variety of applications in a variety of 
industries, including auditory signal processing systems, image separation techniques, the medical industry, 
and a variety of other fields. The CPP is an example of a problem that can be solved using BSE approaches. 
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Parande and Thomas [1] is provides a valuable and well-written review paper that examines various 
strategies developed by the authors to tackle the cocktail party problem and then suggests which technique is 
more beneficial based on applications. The review discovered that the problem can be solved using the 
Independent component analysis (ICA) technique, as well as principal component analysis (PCI) for 
information compression and the Wiener filter (WF) for de-noising. Stöter et al. [4] proposes an effective 
way for extracting the highest number of concurrent signals from speakers in a noisy circumstance utilizing a 
data-driven deep learning algorithm. When sufficient input and features are provided, the approach 
demonstrates that traditional neural networks (NN) can be utilized to solve and analyze the CPP. 

Traditional blind source extraction approaches are used gradient method and obtain not sufficient 
results unless get primary solutions for the initial stage. However, it is not clear to choose the suitable 
parameters values duo to blind hypothesis approach. The classical blind techniques cannot operate well when 
the fitness function is incoherent, therefore the swarm methods is used for blind source techniques to solve 
this problem or limitations [5], [6]. Swarm techniques moves towards of natureinspired with flow different 
behavior of the member such as glow worm, bees, shark, bacteria and so on [7]. Glowworm swarm technique 
is optimization way created by Krishnanand based on manner that a glowworm load a luminescence 
magnitude named luciferin [8]-[10]. The research try to outcomes are matched and compared to human 
performance. Mavaddaty and Ebrahimzadeh [5] presents a novel model for speech separation based on the 
matching of the audio-visual network with the actions of the human lip with a wide range of audio 
separation, the suggested model is used. 


2. COCKTAIL PARITY PROBLEM 

In ordinary lifetime, persons are regularly confronted to attend to definite sound signals or track 
specific chats in the middle of challenging related conversations occurrence mentioned to cocktail party 
problem (CPP) [11]. The sounds that reach a listener's ears from a specific sound source virtually never occur 
in isolation, whether the listener is at a genuine cocktail party, strolling down a bustling street, or conversing 
in a packed coffee shop. They repeatedly happen when there are other competing sources and distractions in 
the auditory surroundings of the listener. The act of organizing this soundscape into meaningful percepts is 
known as "auditory scene analysis" (ASA) [12], [13]. The ASA problem is not just for people. Similar 
challenges must be addressed by animals as well to traverse their intricate auditory environments, escape 
predators, mate, and find their infants. This includes mammals, penguins, songbirds, and fish [14], [15]. 
Engineering systems, including anything from smart phones to military communication and surveillance 
equipment, face a similar difficulty. Kumar and Jayakrishnan [6] showed and discussed this problem in 1953 
using the auditory human system to distinguish the speaker's gender, intensity, and location. The cocktail 
parity problem solution is used to build a hearing aid system for the hearing impaired. Because the human 
has two ears to determine the position and surroundings of desired sources, the CPP algorithms are utilized to 
assist the impaired human in recognizing the useful sound from the mixture. Figure 1 depicts how the brain 
detects people's voices among the mixed inputs. 


Voices Speaker-independent Detection and Neural signal 
speech separation amplification 


Figure 1. Sound recognition in the brain for various sounds 


3. BLIND SOURCE SEPARATION 

The primary goal for any blind source extraction technique is to extract or cover the original sources 
from the combination without relying. On sources or mixing media for guidance [3]. The primary block 
diagram of the blind source extraction approaches [6] is shown in Figure 2. 
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Figure 2. Blind source extraction block diagram [6] 
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The mathematical model for the blind source separation is (1). 


x(t) ai Q2 `“ Qis |] [s 
x2(t) 2 21 22 o Gons |} S2(t) (1) 
Xnr (t) Qyo1 Gnoz2 = Qnrysl LSys(t) 
The following rules regulate the system: 
x(t) =A s(t) (2) 


where A is the mixing matrix, x(t) denotes the received vector, and s(t) denotes the original vector. Because 
there is no information about the A matrix or the original signal in this system, the goal is to obtain the 
original signal s(t) from only the detected signal x. (t). The final step is to get the unmixing matrix W so that 
(2) can be applied: 


*s(t) = Wx(t) (3) 


where W is the A‘! used to estimate the independent signals' sources s(t). 

Blind source extraction techniques have numerous applications in a variety of fields, which are 

discussed below [3]-[6]: 

— Medical application: EEG signal processing extracts information from the brain that is measured by EEG 
as a mixture of electrical brain cell signals from various places with artifacts or noise to reflect brain 
activity. To isolate the useful brain signal from the mixture, independent component analysis is usually 
utilized. 

— Signal and image processing: the BSS is used to extract features from audio and video sources and find 
relevant renderings. 

— Music: there are numerous applications in this area to extract singing and contribute to a combination, such 
as karaoke. 

— Cocktail party: for the design of a hearing aid tool for disabled persons, blind source separation is 
employed to isolate each sound according to position and intensity. 


4. GLOWWORM SWAM TECHNIQUE 

Swarm techniques are established and used in different ways through current years. Like bee colony 
(BC) technique is interested from bee’s performances [15]. Genetic algorithm (GA) is motivated by genetic 
appliance [16]. Glowworm swarm technique is suggested and based on glowworms’ performance [17]. 
Matched with old-style optimization techniques, the swarm procedures are more operative and used to solve 
complex system [18], [19]. To begin, the glowworms approach is first randomly distributed over the solution 
space. Each glowworm carries a certain quantity of luciferin and identifies a basis functions solution that may 
be found in the region being searched. There is a correlation between the quantity of luciferin and the agent's 
current location. A more luminous one is equivalent to a more favorable opportunity glowworm swarm 
optimization [20]-[22] by Krishnan and Ghost mimics the glowworm's behavior of carrying a brightness 
quantity known as luciferin to communicate with its friends. Within the local-decision domain, an individual 
can only be lured by a neighbor whose luciferin brightness is greater than their own, and they can only 
migrate towards that neighbor by a probabilistic procedure [23]. When the density of a glowworm's 
neighbors is not high, the ghowworm's local-decision field will extend to discover more neighbors; otherwise, 
it will constrict to allow the swarm to fracture into tiny teams. Density of the glowworm's neighbors calculate 
the size of its local-decision field. The stages that came before it are repeated over and over again until the 
approach determinate the ultimate final decision. Glowworm technique may be broken down into the 
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following stages: the luciferin updating stage, the neighborhood selection stage, the moving probabilistic 
stage, the action stage, and the judgment radius maintain updating stage [24]. 


5. PROPOSED WORK AND RESULTS 

One of the most important tasks for the blind source extraction is to separate the mixed signals 
without any information about the original sources and the cocktail parity problem is one of the famous 
problem solved by BSS [25]-[28]. In the model presented in Figure 3, two separate microphones are used, 
each of which is a weighted sum of all sources, with the weights based on the distances between the 
microphones. The microphones and receivers are situated in a distinct location from separate and original 
sources. 


| Speaker | After Seperation 


CT Hello 
Hello Blind h 
D> \ Separation 


Namaste 


Namaste 


Figure 3. A solution of cocktail parity problem based on BSE 


The suggested algorithm's primary steps are based on a hybridization of glowworm optimization and blind 
source extraction to provide the optimal separation for the task, as illustrated below: 

Step 1. Collect and measure the received signals x 

Step 2. Centering and whitening process to simplify the difficulties in kurtosis function. Over-all, this 
function is so hard to calculate from measured signal when have large information. 


Kurt (x) = E [xt] -3 (E [x? ]} (4) 


Where E is the mean vector for the received signal. 
Step 3. Set the required glowworm factors (p, y, P, nt, lo) as shown in Table 1. 


Table 1. The parameters values for glowworm algorithm used in the proposed work based on [21] 


Parameter Value 
luciferin decay constant (p) 0.4 
luciferin enhancement constant (y) 0.6 
constant (P) 0.08 
neighbor number (nz) 5 
luciferin (lo) 5 


Step 4. Produce or generate separation vectors as initial solution and then prepare the position x(t) and 
decision radius rd(t) of the individual. 
Step 5. Compute the preliminary fitness function of each individual. 
Step 6. Compute the best location Xop and best fitness (Xop:) for the individual. 
Step 7. Calculate new luciferin l(t) values based on (5), settle the neighbors N(t) based on (6), and calculate 
compute probability P(t) based on (7) [21]. 

li (t+1) = (1-p) li (t) + y Fitness (xi (t+1)) (5) 

Ni (t) = {j : dy (t) < ri d (t); li (t) < Lj (©)} (6) 
The dij(t) is Euclidean distance and r i d(t) is the decision radius. 


Pij © = lj (t) — li (t) NKENi(t) lk (t) — li (©) (7) 
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Step 8. Calculate the new position x (t + 1) and new decision radius rd(t + 1) of individuals based on (8). 


“(t+ =x(t)+s [=] 


PORRO (8) 


The I-I is the norm and s is the step. 
Step 9. Compute the best fitness. If the new fitness is better than previous, then will updated. For every 
updating the radius is governed by (9): 

ri(t +1) = min {rs, max{0,ri(t) + B(nm — IN.(e)D}} (9) 
The rs is the sensory radius. Figure 4 shows the fleshly radius and judgement radius of glowworm i. 


Step 10. Check the stopping criteria. Figure 5 shows the main flowchart for the proposed work. To catch the 


estimate original signals based on received mixtures, the blind technique approach is employed, in order to 
solve the Cocktail Party Problem. 


Figure 4. Sensory radius and decision 


positions x(t) and them calculatethe fitness fumction. 


Updating the luciferin Eq. (4), approvethe neighbors as Eq. (5), and calculate the 
probability ofmovement as Eq. (6) 


Calculate new position and new decision radius of each individual 


Compute the fitness of all mdividual and extract the optimal one. 


Yes 


Figure 5. Flowchart for the proposed work 
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Also, some simple assumptions must be met, such as that the original sources are statistically 
independent and that the mixing medium matrix is a square. As indicated in Figure 6, two input auditory 
actual data files are acquired from [29]. The square matrix mixes these input audios at random to produce 
mixture signals, as shown in Figure 7. The proposed approach is used to extract and isolate the original 
signals from each other after the centering and whitening processes, as shown in Figure 8. 

The preceding numbers demonstrate that the separation method, which is based on the stone BSE 
technique, is quite efficient, as evidenced by listening to the separated sound files. Two blind source 
separation approaches (rapid ICA and JADE) are also used to test the proposed approach's validity against 
other methods. Table 1 illustrates the performance criteria for several types of BSS; the suggested work is the 
best in this table since it has a high SNR and a short convergence time assessed in MATLAB using the (tic- 
toc) function. The similarity between original and extracted signals is calculated using the MATLAB 
‘corrcoef' function, as shown in the last column of Table 2. The values in the diagonal matrix (0,0) & (1,1) 
represent the autocorrelation function of the source with itself, so the value should be equal '1', but other 
values in the correlation matrix for proposed approach closed to '1' with high accuracy when compared to 
FICA and JADE different blind source extraction techniques. 


Inpa Simal ( 


lapan Seamal (2) 
1 t 
05 05 
S a S ‘ 
ct e 
2 2 
s3 KF) 
ʻa a 
a as i ~ 15 2 25 eio z 1 5 5 
a 05 is 2 25 *10" 
Taw Tane 


Figure 6. The utilization of two real input signals 
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Figure 8. Original and extracted signals using the suggested method, vertically shifted for comparison 
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Table 2. The compression results 


Proposed Work FICA JADE 
Signal SNR Elapsed correlation SNR Elapsed correlation SNR Elapsed correlation 
time matrix (r) time matrix (r) time matrix (r) 
Signal 7.887 0.47187 7.108 0.994963 5.988 0.890705 
a) l 1 0.999 l 1 ea [ 1 0. Aa 
Signal 16.138 0.998 1 12.123 0.899 1 13.563 0.971 1 
2) 
Mean 12.0125 9.6155 9.7755 


6. CONCLUSION 

To propose the good optimization approach for blind source extraction techniques, an innovative 
step change approach for the glowworm swarm technique is used to enhance the separation process. The 
experiment results show the GSO has robust universal search capability and quicker convergence. thus, the 
this algorithm is more appropriate for BSE. Glowworm optimization technique is presented as a new 
technique for solving the cocktail parity problem, and it is used as a first in this field, where practically all 
cocktail parity problems are solved using classic source separation techniques. This technique is a better 
choice and more effective than the other blind source extraction strategies now in use. In addition, this 
research proposes a comparison examination of three blind strategies for solving the problem and evaluating 
their performance. The goal of this research is to employ it in the ‘computational auditory scene analysis’ 
(CASA) system in the future. Soft computing techniques can also be utilized to improve the separation 
process and isolate high-performance signals. Finally, the findings show that the suggested approach is 
capable of efficiently separating the original source while requiring less convergence time. 
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